Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance
نویسندگان
چکیده
With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined the effect on Automatic Speech Recognition (ASR) and found that the Word Error Rates (WER) on older voices is about 9% absolute higher compared to those of adult voices. Subsequently, we compared several voice source parameters including fundamental frequency, jitter, shimmer, harmonicity and cepstral peak prominence of adult and older males. Several of these parameters show statistically significant difference for the two groups. However, artificially increasing jitter and shimmer measures do not effect the ASR accuracies significantly. Artificially lowering the fundamental frequency degrades the ASR performance marginally but this drop in performance can be overcome to some extent using Vocal Tract Length Normalisation (VTLN). Overall, we observe that the changes in the voice source parameters do not have a significant impact on ASR performance. Comparison of the likelihood scores of all the phonemes for the two age groups show that there is a systematic mismatch in the acoustic space of the two age groups. Comparison of the phoneme recognition rates show that mid vowels, nasals and phonemes that depend on the ability to create constrictions with tongue tip for articulation are more affected by ageing than other phonemes.
منابع مشابه
Longitudinal study of ASR performance on ageing voices
This paper presents the results of a longitudinal study of ASR performance on ageing voices. Experiments were conducted on the audio recordings of the proceedings of the Supreme Court Of The United States (SCOTUS). Results show that the Automatic Speech Recognition (ASR) Word Error Rates (WERs) for elderly voices are significantly higher than those of adult voices. The word error rate increases...
متن کاملWhistleblowing Need not Occur if Internal Voices Are Heard: From Deaf Effect to Hearer Courage; Comment on “Cultures of Silence and Cultures of Voice: The Role of Whistleblowing in Healthcare Organisations”
Whistleblowing by health professionals is an infrequent and extraordinary event and need not occur if internal voices are heard. Mannion and Davies’ editorial on “Cultures of Silence and Cultures of Voice: The Role of Whistleblowing in Healthcare Organisations” asks the question whether whistleblowing ameliorates or exacerbates the ‘deaf effect’ prevalent in healthcare organisations. This comme...
متن کاملThe Effect of Vocal Loudness on Nasalance of Vowels in Persian Adults
Objectives: Nasality is one of the important parameters in pathology of voice resonance. Voice of normal adults has nasality to some extent. It appears that nasality, like other parameters of voice, can be affected by loudness which can be measured in experimental evaluations. This study was conducted to determine the effect of vocal loudness on nasalance of vowels in normal adults and to ident...
متن کاملImmediate effects of vocal warm-up exercises on elementary teachers' voice
Introduction: Teachers are a large group of professional voice users who are exposed to many voice problems. Vocal warm-up exercises (VWUE) can prepare the muscles involved in vocalization before teaching and can reduce voice damage in teachers. However, limited studies have examined the effects of VWUE on teachers' voices. Therefore, the present study was conducted to investigate the immediate...
متن کاملP41: The Effects of Voice-Induced Stress on Mice Testes Parameters
Reports indicate that one of the causes of harmful to the genital system, especially the testes stress. Stress could be created secondarily after some pathological conditions such as neurological diseases or environmental factors. One of the causes of stress could be scary voices, such as cat voice for mice. In this study we aimed to investigate the testes parameters in the animals that were ex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Audio, Speech and Music Processing
دوره 2010 شماره
صفحات -
تاریخ انتشار 2010